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Abstract 


The severe acute respiratory syndrome-coronavirus nucleocapsid (N) protein is involved in virus replication and modulation of cell 
processes. In this latter respect control may in part be achieved through the sub-cellular localisation of the protein. N protein predom- 
inately localises in the cytoplasm (the site of virus replication and assembly) but also in the nucleus/nucleolus. Using a combination of 
live-cell and confocal microscopy coupled to mutagenesis we identified a cryptic nucleolar localisation signal in the central part of the N 
protein. In addition, based on structural comparison to the avian coronavirus N protein, a nuclear export signal was identified in the 


C-terminal region of the protein. 
© 2007 Published by Elsevier Inc. 
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The coronavirus and related arterivirus nucleocapsid 
(N) protein can localise to both the cytoplasm and the 
nucleolus [1-4], and may be cell cycle-related [5]. 
Although the principal site of replication for both viruses 
is the cytoplasm, interaction with the nucleolus is an 
emerging paradigm for RNA viruses [6]. The nucleolus 
is a dynamic sub-nuclear structure involved in ribosome 
subunit biogenesis, RNA processing, control of cell 
growth and response to cell stress [7-9]. Trafficking 
between the two compartments may be determined by 
appropriate signals [10]. N protein is a phosphoprotein 
[11,12] with roles in replication and interacting with host 
cell processes. Three conserved regions (I, II and III) can 
be identified in the coronavirus N protein, and may play 
a role in different functions such as RNA binding [13]. 
The coronavirus infectious bronchitis virus (IBV) N 
protein contains an eight-amino acid nucleolar localisa- 
tion signal (NoLS) in region 1 [14], and the arterivirus 
porcine reproductive and respiratory syndrome virus N 
protein contains an NLS which is required for efficient 
replication [15]. If the N proteins traffic to the nucleolus 
then they must contain appropriate signals in order to 
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return to the cytoplasm, to play a principal role in virus 
biology. Indeed IBV N protein contains a non-CRM1 
dependent nuclear export signal (NES) in region 3 
[14,16]. 

The severe acute respiratory syndrome coronavirus 
(SARS-CoV) N protein has also been reported to local- 
ise to the nucleolus/nucleus [17-19]. However, nucleolar 
localisation was not observed in virus infected cells 
[20,21]. Over expression of SARS-CoV N protein indi- 
cated that nucleolar localisation was less than compared 
to IBV N protein [20]. However, SARS-CoV N protein 
could be translocated to the cytoplasm from the nucleus 
via binding to the 14-3-3 protein [22]. Taken together 
the data suggested that the SARS-CoV N protein may 
contain a strong NES. The signals which mediate the 
potential trafficking of SARS-CoV N_ protein are 
unknown [20]. Deletion mutagenesis revealed that 
regions | and 2 could direct a fluorescent fusion protein 
to the nucleus and _ nucleus/nucleolus, respectively. 
Region 3 could direct a fusion protein to the cytoplasm 
[20]. The predicted NES in region 2 was also found to 
be non-functional [20]. This study investigated the 
potential signals involved in trafficking of the SARS- 
CoV N protein to the nucleolus and its export from 
the nucleus. 
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Materials and methods 


Plasmids. The expression cDNAs used in this study were generated by 
PCR and cloning into pEGFP-C2 and pECFP-C1 (Clontech) (which 
express the fusion protein C-terminal of enhanced green and cyan fluo- 
rescence proteins, respectively), utilizing non-templated restriction sites in 
sequence specific primers [14,23]. The plasmids generated were; pECFP- 
SARSCoV-NHAB, pECFP-SARSCoV-NHBC, pEGFP-SARSCoV- 
NUA, pEGFP-SARSCoV-NIIB and pEGFP-SARSCoV-NII. The 
potential NES in SARS-CoV N protein was deleted by overlapping PCR 
using primers NewNESfor and NewNESrev (details of primers available 
on request). All constructs were verified by sequencing in both directions 
and Western blot (data not shown). 

Transfection and imaging. Cos7 cells were grown and transfected using 
Lipofectamine (Invitrogen) as described previously [20,23]. Live-cell 
imaging was performed on a Nikon Eclipse TS 100 microscope utilising 
the appropriate filter for EGFP and ECFP. Fluorescence and bright-field 
images were captured. Cells were processed for confocal microscopy as 
described [14,23]. 


Results and discussion 


SARS-CoV N protein region II contains motifs capable of 
directing nucleolar localisation 


To investigate whether SARS-CoV N protein contained 
motifs capable of directing nucleolar localisation, deletion 
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mutagenesis was performed on region II to delineate any 
potential amino acids involved in this process. Previously, 
we have shown that region II (amino acids 157-299) can 
direct ECFP to the nucleolus [20]. Following the approach 
which was used to identify the NoLS in IBV N protein [14], 
region II of SARS-CoV N protein was subdivided into five 
fragments AB, BC and A, B and C which spanned amino 
acids 157-246, 210-299, 157-209, 210-247 and 247-299, 
respectively. These fragments were cloned C-terminal of 
either the ECFP or the EGFP reporter protein, 
creating plasmids pECFP-SARSCoV-NIIAB, pECFP- 
SARSCoV-NIIBC, pEGFP-SARSCoV-NIIA, pEGFP- 
SARSCoV-NIIB and pEGFP-SARSCoV-NHC, which 
when transfected into mammalian cells led to the expres- 
sion of the appropriate fusion protein. The localisation of 
these proteins was compared to region II which had been 
previously cloned downstream of ECFP (referred to in this 
study as pECFP-SARSCoV-NII [20]), at 24-h post-trans- 
fection using both live-cell and confocal microscopy (Note 
we have shown previously that EGFP, ECFP and DsRed 
localise to the cytoplasm and nucleus with no preferential 
accumulation in either compartment [14,16,20]). The data 
indicated that ECFP-SARSCoV-NII protein localised pre- 
dominately to the nucleolus with some nuclear localisation 
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Fig. 1. Live-cell imaging of the distribution of ECFP-SARSCoV-NII protein in Cos7 cells; fluorescent and bright view of the same image is presented, the 
nucleolus is indicated. Below is shown a confocal image of the nucleus of a Cos7 cell expressing ECFP-SARSCoV-NH protein (green) and DsRedB23.1 
(red). A merged image is also presented and any co-localisation is shown in yellow. A nucleolus is indicated. In the merged image the line is 10 jum. Relative 
fluorescent intensity analysis of the distribution of ECFP-SARSCoV-NII protein (green) and DsRedB23.1 (red) in the nucleus. The white line on the 
merged image to the left is the region analysed. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version 


of this paper.) 
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Fig. 2. Relative fluorescent intensity analysis of the distribution of fragments of region II of SARS-CoV N protein fused C-terminal of ECFP/EGFP 
(green) (described in the text and indicated in the appropriate image) and DsRedB23.1 (red). The white line on the merged confocal image to the left is the 
region analysed. For aid, the red and blue dots represent the vertical red and blue lines for orientation. (For interpretation of the references to colour in 


this figure legend, the reader is referred to the web version of this paper.) 


(as described previously [20]) when analysed using live-cell 
and confocal microscopy (Fig. 1). Relative fluorescent 
intensity analysis of ECFP-SARSCoV-NII and the nucleo- 
lar marker protein B23.1 (plasmid pDsRedB23.1 [20]) indi- 
cated that ECFP-SARSCoV-NII was _ predominately 
nucleolar (Fig. 1). 

To investigate whether region II contained a potential 
NoLS, cells were co-transfected with pDsRedB23.1 and 
pECFP-SARSCoV-NIIAB, | pECFP-SARSCoV-NIIBC, 


pEGFP-SARSCoV-NHA, pEGFP-SARSCoV-NIIB and 
PEGFP-SARSCoV-NHC and analysed 24-h post-trans- 
fection using confocal microscopy and quantitative 
florescence (Fig. 2). The data indicated that ECFP- 
SARSCoV-NHAB localised predominately to the 
cytoplasm and the nucleolus, ECFP-SARSCoV-NIBC 
localised predominately to the nucleus with some cyto- 
plasmic localisation. .EGFP-SARSCoV-NHUA — and 
EGFP-SARSCoV-NIIB | similar to EGFP-SARSCoV- 
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NIIAB localised predominately to the cytoplasm and 
nucleolus. EGFP-SARSCoV-NIIA also co-localised with 
DsRedB32.1, pointing to nucleolar localisation. EGFP- 
SARSCoV-NIIC localised to the cytoplasm and nucleus. 
If no trafficking signals were present, then the fusion 
protein would be distributed evenly between the cyto- 
plasm and nucleus. This data indicated that there 
were no defined short amino acid sequences that directed 
nucleolar localisation—as is the case with the eight- 
amino acid motif in IBV N protein [14]. None of these 
fragments reproduced the predominately nucleolar local- 
isation when ECFP was fused to region I (Fig. 1), sug- 
gesting that a cryptic NoLS(s) may encompass SARS- 
CoV N protein region II. 

Nucleolar localisation of SARS-CoV N protein may 
therefore be a case of molecular mimicry, in which amino 
acids are ordered such that they resemble a NoLS [24], 
and account for the low frequency of observation [20]. Con- 
versely many NOLS are larger than eight amino acids, for 
example the survivin-AEx3 protein is 17 amino acids [25] 
and the herpes saimiri virus ORF57 protein contains two 
distinct nuclear localisation signals which together act as 
an NoLS [26]. Analysis of the human nucleolar proteome 
suggested certain amino acids and motifs are over and 
under represented [9]. For example, nucleolar proteins con- 
tain more lysine and arginine residues than generic cellular 
proteins (12%). For example, by weight KR makes up 
17.5%, 20.21% and 16.12% of regions 1, 2 and 3, respec- 
tively. Overall KR contributes 17.98% of SARS-CoV N 
protein by weight. Nucleolar proteins also appear to have 
less cysteine and histidine residues than cellular or nuclear 
proteins [9]. SARS-CoV N protein contains no cysteine res- 
idues and 1.45% histidine by weight, generic cellular pro- 
teins are approximately 2% and 3%, respectively. GRG 
and RGG motifs are also frequent in nucleolar proteins 
and one third of the human nucleolar proteome contains 
an RGG motif at least once [9]. Amino acid sequence anal- 
ysis indicates that SARS-CoV N protein contains no GRG 
motif but does contain two RGG motifs located at amino 
acids 96-98 in region | and 178-180 in region 2 (and thus 
in fragment A). However, SARS-CoV N protein region 1 
directs a fusion protein to the nucleus but not nucleolus 
[20]. Taken together the presence of a RGG motif and a 
higher percentage by weight of KR residues in region II 
may result in any nucleolar localisation of the protein. 


SARS-CoV N protein region III contains a novel NES 


Recently, we have shown that the avian coronavirus N 
protein contains a leucine rich motif in the C-terminal 
region of the protein which acts as an NES [14,16]. There- 
fore the hypothesis was tested that region HI of SARS- 
CoV N protein contained a motif which could act as an 
NES. To identify such a motif the structure of the C-termi- 
nal end of SARS-CoV N protein (amino acids 270-370) 
[27] was compared that of the C-terminal part of IBV N 
protein (amino acids 220-333) [28] using the Deep View 


structural alignment algorithm. The data indicated that 
the two structures could be superimposed and that the 
NES identified in the IBV N protein mapped to amino 
acids 324-EVTPSGTWLT-334 on the SARS-CoV N pro- 
tein (Fig. 3). Amino acid sequence alignment using AlignX 
(Vector NTI) in the context of the wild-type full-length 
proteins indicated several conserved amino acids at this 
region (Fig. 3). To investigate whether this motif was 
involved in nuclear export of SARS-CoV N protein dele- 
tion mutagenesis was used to remove the motif in the con- 
text of SARS-CoV N protein fused to ECFP, creating 
plasmid pECFP-SARSCoV-NANES. Cells were co-trans- 
fected with pDsRedB23.1 and either pECFP-SARSCoV- 
N [23] (which expressed wild-type SARS-CoV N protein 
C-terminally fused to ECFP) and pECFP-SARSCoV- 
NANES and examined using live cell and confocal micros- 
copy 24-h post-transfection (Fig. 4). The data indicated 
that in contrast to ECFP-SARSCoV-N protein which 
localised to the cytoplasm, with little signal in the nucleus 
(Fig. 4), ECFP-SARSCoV-NANES localised to the cyto- 
plasm and nucleus (Fig. 4), which would be predicted if 
nuclear export had been disrupted. 

Clearly, SARS-CoV N protein contains amino acid 
motifs which would allow its trafficking to the nucleus. 
Given that this protein is observed in the cytoplasm of 
infected cells and over-expression studies [21,23], the possi- 
bility arises that either N protein is inefficiently imported 
into the nucleus or efficiently exported. The former possi- 
bility may occur through structural alteration of the pro- 
tein exposing the potential NoLS in region H, perhaps 
through differential phosphorylation or multi-merisation. 
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Fig. 3. Comparison of the structure of the C-terminal region of IBV and 
SARS-CoV N protein (indicated). The IBV N protein NES motif is 
indicated in yellow. Below is shown the sequence alignment of the relevant 
SARS-CoV N protein sequence with the IBV N protein NES motif. 
Arrows indicate key residues involved in the activity of the NES. (.) 
indicates a conserved amino acid and (:) indicates an identical amino acid. 
(For interpretation of the references to colour in this figure legend, the 
reader is referred to the web version of this paper.) 
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Fig. 4. Live-cell and confocal microscopy of the distribution of ECFP-SARS-CoV N protein in cells. Also shown in the confocal image is the expression of 
DSRedB23.1, a relative fluorescent intensity profile of this cell is also shown. Below are shown live-cell and confocal microscopy of the distribution of 
ECFP-SARSCoV-NdNES and relative fluorescent intensity are shown. Cells were also co-expressing DsRedB23.1. 


Deletion and mutagenesis analysis in this and previous 
studies [20] indicates that the NES motif identified between 
amino acids 324-EVTPSGTWLT-334 in SARS-CoV N 
protein is the dominant signal in determining N protein 
localisation. This motif does not conform to a classical leu- 
cine-rich NES and thus represents a novel motif involved in 
nuclear export. 
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